Exploiting News to Categorize Tweets: Quantifying the Impact of Different News Collections
نویسندگان
چکیده
Short texts, due to their nature which makes them full of abbreviations and new coined acronyms, are not easy to classify. Text enrichment is emerging in the literature as a potentially useful tool. This paper is a part of a longer term research that aims at understanding the effectiveness of tweet enrichment by means of news, instead of the whole web as a knowledge source. Since the choice of a news collection may contribute to produce very different outcomes in the enrichment process, we compare the impact of three features of such collections: volume, variety, and freshness. We show that all three features have a significant impact on categorization accuracy.
منابع مشابه
Modeling the Impact of News on volatility The Case of Iran
In this paper various ARCH models and relevant news impact curves including a partially nonparametric (PNP) one are compared and estimated with daily Iran stock return data. Diagnostic tests imply the asymmetry of the volatility response to news. The EGARCH model, which passes all the tests and appears relatively matching with the asymmetry in the data, seems to be the most adequate characteriz...
متن کاملNews-Topic Oriented Hashtag Recommendation in Twitter Based on Characteristic Co-occurrence Word Detection
Hashtags, which started to be widely used since 2007, are always utilized to mark keywords in tweets to categorize messages and form conversation for topics in Twitter. However, it is hard for users to use hashtags for sharing their opinions/interests/comments for their interesting topics. In this paper, we present a new approach for recommending news-topic oriented hashtags to help Twitter use...
متن کاملTweet-Recommender: Finding Relevant Tweets for News Articles
Twitter has become a prime source for disseminating news and opinions. However, the length of tweets prohibits detailed descriptions; instead, tweets sometimes contain URLs that link to detailed news articles. In this paper, we devise generic techniques for recommending tweets for any given news article. To evaluate and compare the different techniques, we collected tens of thousands of tweets ...
متن کاملA Study on News Anchors’ Meta-Language and Non-Verbal Factors and their Impact on Audiences
Non-verbal communication or body messaging occurs when facial expressions, tone of voice, head and neck movements, smiling and ... affects others; which may be intentional or unintentional. Farhangi in nonverbal communication: the art of using movement and sound” defines this field as such: "Non-verbal communication is phonetic and non-phonetic messages which have been explained by other than l...
متن کاملDistant Supervision for Topic Classification of Tweets in Curated Streams
We tackle the challenge of topic classication of tweets in the context of analyzing a large collection of curated streams by news outlets and other organizations to deliver relevant content to users. Our approach is novel in applying distant supervision based on semi-automatically identifying curated streams that are topically focused (for example, on politics, entertainment, or sports). ese ...
متن کامل